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For statistical systems that violate one of the four Shannon-Khinchin 
axioms, entropy takes a more general form than the Boltzmann- 
Gibbs entropy. The framework of superstatistics allows one to formu- 
late a maximum entropy principle with these generalized entropies, 
making them useful for understanding distribution functions of non- 
Markovian or non-ergodic complex systems. For such systems where 
the composability axiom is violated there exist only two ways to im- 
plement the maximum entropy principle, one using escort probabili- 
ties, the other not. The two ways are connected through a duality. 
Here we show that this duality fixes a unique escort probability, which 
allows us to derive a complete theory of the generalized logarithms 
that naturally arise from the violation of this axiom. We then show 
how the functional forms of these generalized logarithms are related 
to the asymptotic scaling behavior of the entropy. 

thermodynamics | entropy | classical statistical mechanics | correlated systems 

The concept of superstatistics [l] [2] [3] provides a formal 
framework for a wide class of generalizations of statis- 
tical mechanics that were introduced recently. Within this 
framework it is possible to formulate a maximum entropy 
principle, even for non-ergodic or non-Markovian systems, in- 
cluding many complex systems. From an axiomatic point 
of view, non-additive systems are characterized by the fact 
that the fourth Shannon-Khinchin (SK) axiorrQ, governing 
composability of statistical systems, is violated. For sys- 
tems where all four SK axioms hold, the entropy is uniquely 
determined as the Boltzmann-Gibbs-Shannon entropy [HE], 
Sbgs = — k^2pi\ogpi. In the case where only the first three 
axioms are valid (e.g. non-Markovian systems) the entropy 
has a more general form [BJ. In the thermodynamic limit - 
which captures the asymptotic behavior for small values of 
the pi - the entropy is given by the formula 



S c d <x y^T(l + d, 1 - clogpi) , 



[1] 



where F is the incomplete gamma function and (c, d) are con- 
stants that are uniquely determined by the scaling properties 
of the statistical system in its thermodynamic limit. In pre- 
vious work [7] we were able to show that for systems where 
the first three SK axioms hold, there exist only two ways to 
formulate a consistent maximum entropy principle. Starting 
with an entropy of "trace form" 



s[p] = X>(pO 



[2] 



the maximization condition becomes r54* = with 



<K[p] = %] - a 



where the last two terms are the constraints. The first of 
the two possible approaches (HT approach) [S] [5], uses a 



generalized entropy and the usual form of the constraint, 
Qf T [p] = Pi. The other approach (TS approach), suggested 
in [10] , uses a generalized entropy and a more general way to 
impose constraints 



[p,s] = Pi[p,s] = 



Pi + vs{pi) 
J2jPj +vs(pj) 



[4] 



Pi is a so-called escort probability and v is a real number. 
While in the HT case the constraint has the usual interpre- 
tation as an energy constraint, we do not attempt to give a 
physical interpretation of the escort probabilities. The two ap- 
proaches have been shown to be connected by a duality map 
* : Sht Sts, with ** (meaning applying * twice) being the 
identity [7]. A special case of this duality has been observed 
in [IT]. 

Entropies can be conveniently formulated using their as- 
sociated generalized logarithms. We first specify the space £ 
of proper generalized logarithms A £ C We consider a gener- 
alized logarithm to be proper if the following properties hold: 

• A is a differentiable function A : R+ — > R. This is nec- 
essary for a finite second derivative of the entropy. 

• A is monotonically increasing, which is a consequence of 
the second SK axiom. 

• A(l) = 0, captures the requirement that the entropy of 
single-state systems is 0. 

• A'(l) = 1, is needed to fix the units of entropy. 

In both approaches [HT and TS) there exist proper general- 
ized logarithms Art and Ats such that 



SHr(pi) 



dx A H t(x/x ) , 



and 



STS,v(Pi) = —k / dxA T s,f{x/x ) . 
Jo 



[61 



with xo a constant. If both approaches predict the same distri- 
bution function p = {pi}^! as a result of the maximization of 
Eq. ([3}, then it can be shown that the two entropic functions 
sht and sts are one-to-one related by 



Ats,v(x) A H t(x) 



kv . 



[7] 
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1 Shannon-Khinchin axioms: (i) Entropy is a continuous function of the probabilities p^ only, 
i.e. s should not explicitly depend on any other parameters, (ii) Entropy is maximal for the 
equi-distribution pi — 1/W. - From this the concavity of s follows, (iii) Adding a state 
W + 1 to a system with p\y_i_i — does not change the entropy of the system. - From 
this s(0) — follows, (iv) Entropy of a system composed of 2 sub-systems A and B, is 
S(A + B) = S(A) + S(B\A). 



In the following we set k = 1. This can be achieved either by 
choosing physical units accordingly, or by simply absorbing k 
into v, so that v becomes a dimensionless parameter. 

The full implications of Eq. (0, which is related to the 
essence of this paper, can be summarized as follows. The 
statistical properties of a physical system, for instance a su- 
perstatistical system as discussed in [7], uniquely determine 
the entropy Sht- A priori, there exists a spectrum of TS- 
entropies, Sts,v, whose boundaries are determined by the 
properties of the generalized logarithm associated with Sht- 
Moreover, these properties determine a particular value v* , so 
that Sts,v* and Sht become a pair of dual entropies. This 
unique duality allows us to derive a complete theory of gen- 
eralized logarithms naturally arising as a consequence of the 
fourth SK axiom being violated. We present a full understand- 
ing of how the TS and the HT approaches are interrelated and 
derive the most general form of families of generalized loga- 
rithms that are compatible with a maximum entropy principle 
and the first three SK axioms. Finally, we demonstrate how 
these logarithms can be classified according to their asymp- 
totic scaling properties, following the results presented in [6]. 

The duality 

In contrast to the images of generalized logarithms, which 
need not span R completely and can differ from one another, 
the domain of generalized logarithms is always all of R+. For 
these reasons one may classify generalized logarithms accord- 
ing to the minimum and maximum values of their images and 
consider the group G of order-preserving automorphisms on 
R+ that keep an infinitesimal neighborhood of I £ R+ invari- 
ant, as the means to generate these classes. In the following 
we call the elements g of this automorphism group scale trans- 
formations. More precisely, g G Q is a scale transformation 
if g is differentiable and maps R+ to R+ one-to-one, g > 0, 
g(l) — 1, and g'(l) = 1. From these properties it follows that 
g(0) — and lim; C _> 00 g(x) = oo. Finally, we use the notation 

f°g(x) = f(g(x))- 

Scale transformations leave the image of a generalized log- 
arithm invariant. This allows us to parametrize classes in the 
following way. Given a proper generalized logarithm A G C, 
we write for its maximum and minimum values 



A 



i{A(i)|x e l + } , A 



t{A(i)|i£R + },, [8] 



and define two functionals 



1 



z/_[A] = -= < 
1 J A " 



f+[A] = -j >0, 



[91 



which associate numbers v+ and v~ to any A. For their sum 
we write v* = v+ + V-. Next, we define sets of proper gener- 
alized logarithms, 



{A G C | v+ = v + [A] and V- = v- [A]} 



[10 



Members of C v+ ,v_ all have the same maximum and minimum 
values. In fact, the C v+ , v _ are exactly the equivalence classes 
in C generated by G: Two generalized logarithms A'-' 4 ' and 
are considered equivalent if there exists a scale trans- 
formation g G G such that A'" 8 ' = A' A ' o g. The space of 
generalized logarithms can be written as the union of these 
sets, £. = U f y + .f_ 

With these definitions we now analyze the relation be- 
tween the HT and TS approaches. Assuming that Aht is 



given, Eq. Q implies 

Ats,u(x) = T, o Aht{x) . 



Ill] 



T„ is a shift operator with the property T v o T M = T„+ M . We 
have of course Ats.o = Aht- The fact that Aht is a proper 
generalized logarithm does not imply that Ats is also proper 
for all choices of v. 

In fact, given that A 6 C, it can be shown (see supporting 
information) that T v o A G C if and only if v- [A] < v < v+ [A] . 
Moreover, for A G £„ +i „_ and for T„oA being a proper gen- 
eralized logarithm it follows that T v o A 6 C v+ - v ,v--v As 
a consequence Ats,v{x) = T v o Aht(x) is proper only for 
v-[A H t] <v< v+[Aht], and 



- v Aht € Cu, 



[12] 



This equation does not uniquely determine a duality rela- 
tion * on C, yet by imposing the condition that * commute 
with scale transformations g G Q, it can be shown (see sup- 
porting information) that * is given by 



A* = T v++V _ o A for AG C v+ , v _ 
with the property 



A G C v+ , v _ & A* G C- 



[13] 



[14] 



Thus for each Aht there exists a unique value v* — 
v+[Aht] + v-[Aht] such that Ats.v* is a proper generalized 
logarithm. The duality map * gives Ats,v* ~ A* H t- Further- 
more, since * and g commute ((A o g)* = A* o g), any proper 
generalized logarithm A can be decomposed into a specific 
representative A„ +i „_ G C v+ ,v_, and a scale transformation 
g, so that 



A = A v v _ o g . 



15 



This implies that any Aht or Ats,^ can be decomposed in this 
way and that the dual logarithms, Aht and A* HT = Ats,v* 
transform identically under scale transformations. 

The functional form of the generalized logarithms 

Equation (|28[) implies the existence of transformations that 
map members of C v+ , v _ to members of £_„_,_„,. These 
maps can be used to represent the duality * on specific fam- 
ilies A v+ , v _ G C v+ ,v_- A(x) — i —A(l/x) is exactly such 
a map, since max{— A(l/x) | x G R+} = max{— A(x) \ x G 
R+} = - min{A(a;) | x G R+} = -A. The same holds for 
min{— A(l/x) \ x G R+} = —A. This allows us to construct 
A„ + ,j,_ with the properties 

AD, , v _ (x) = A- v _ 



\x) = -A v+ ^_{l/x) 



[161 



By using Eq. (f?7)) and inserting A* „_(x) = — A v+ , u _ (1/x) 
into Eq. ([7]), we get 



+ 



A v+ , v _(l/x) A u+>v _(x) 



[17] 



This equation may have many solutions A„ +i „_, but we can 
restrict ourselves to finding a particular one. All the oth- 
ers can be obtained by scale transformations. This is seen 
as follows: Suppose A„,,„_ and A^'„_ are both solutions of 
Eq. (|f 71) ; then according to Eq. (|f 5[) for any pair (v+,v-) 
there exists a scale transformation g VJ _.v_ such that A^„_ = 



2 I 



&v+,v_ °9v+,v_- Since g u+ ,v_ must leave Eq. ()16|) invariant 
(this is not the case for arbitrary scale transformations g G G), 
these scale transformations have two properties. The first is 
9vj r ,v_ (x)g v+lV _ = 1, which makes them members of a 

subgroup g 6 So C Q of all possible scale transformations 
g & G- The second property is gv_i_ J v_ — g~ and follows 
from the fact that * commutes with scale transformations. 
A particular solution of Eq. (fT7)l is given by 



A„ +>v _{x) = 



^+ + V- 



-ft(!i^log(x)) 



[18] 

with /i : K — > [—1,1] a continuous, monotonically increasing, 
odd function, with lim^-^oc h(x) = 1 and h'(Q) = 1. It can 
easily be verified that this solution has all the required proper- 

(cor- 



ties: Ai/,,i/_ is a proper logarithm with A„, € &/, 
rect minimum and maximum), AJ. „_(a:) = — Aj, 
A_ (x), A. v ,-u(x) = h(ylog(x))/v is self-dual, and 

lim, y+ ^ lim, y _^ A„ + ,„_ (a;) = log(x). 

This means that we can generate a specific family of log- 
arithms A„ +i „_, following Eq. (|16p . by choosing one partic- 
ular function h (for instance h(x) — tanh(a:)) and then using 
scale transformations to reach all other possibilities. In par- 
ticular, some family A u+ ,v_ with the property AJ „_(#) = 
A_„_ ,- v+ (x) can be reached by a family of scale transforma- 
tions g v+ , v _ = £v + ,v„ ° A v+ , v _ G G, where = AJ7* _ 
are generalized exponential functions (inverse functions of log- 
arithms). Moreover, if A„ +i „_ also follows Eq. (|16|l . then 

£fl/ + ,I/_ G Go- 

The family of dual logarithms discussed in [7] is obtained 
in the framework presented here by setting either v+ = or 
V- = 0. These classes correspond to logarithms that are un- 
bounded either from below or from above while the duality 
maps £„,o £0,-1,. Moreover, in [7] only pairs of dual loga- 
rithms have been considered such that A* (a;) = — A(l/a;), and 
the part scale transformations play in the unique definition of 
* had not yet been described. 

We are now in a position to understand all observable dis- 
tribution functions emerging from the two approaches in terms 
of a single two-parameter family of generalized logarithms 



and a scale transformation. This result now raises 



the question of how A„, tV _ is related to the two-parameter 
logarithms associated with the (c, d)-entropies in Eq. ((TJ, [6J. 
That will further clarify the role of the scale transformations. 



logarithm and (c, d) -entropy 



The A I/+)I/ _ 

Generalized entropies can be classified with respect to their 
asymptotic scaling behavior in terms of two scaling exponents 
c and d, where < c < 1 and d is a real number [BJ. They are 
obtained from the scaling relations 



A' : 



lim 



s(\x) 
o s(x) 



(1 + a) = lim r -f 

x -> x ac s(x) 



[191 



where s is the summand in Eq. Using Eqs. (|19|l . de 

l'Hopital's rule, and the fact that s'(x) = —A(x), we find the 
exponents (c, d) for a given A 6 £ 



A(Xx) 
x^io A(x) ' 



lim 



(1 + a) 



lim 



A{x 1+a ) 



»a(c— 1) 



A(x)' 



[201 



where we represent A as A = A v+ . v _ o g. In this way we 
get the dependence of (c, d) as a function of (v+, h, and 
the scale transformation g. We first compute the asymptotic 
properties of h and g, defining the exponents Ch, g and dh, g by 



X OH, g = lim fhA Xx ) f , 



*->0 <fih,g{x) 



, (1 + a) ah " = lim 



^, 9 (^ 1+Q ) 



[21] 

where <ph, 9 = 1 + h o logogi(a;). Note that logog G £o,o- By 
defining Ao = log og we compute its scaling exponents Co and 
d 



A 



CQ-l 



Aq(Az) 
A (a;) 



= lim 



(1 + a) d ° = lim 



A (a; 



l + a\ 



o a; a ( c o- 1 )Ao(a;) 



[221 



With these preparations one can derive the results 

( 1 for v+ ^ , 

c = < 1 - Ch, 9 for z/+ = and c / 1 , 

I 1 — £h, g (— ~y) ^° for z/+ = and Co = 1 , 

for u+ ^ , , 

-<4 j9 for !/+ = . 



d = 



231 



This demonstrates clearly that, given a fixed h, c is controlled 
by (for = and Co = 1), while d is determined by the 
scale transformation. 

Examples 

Example 1 - A simple choice for h: For example, fix 
h(x) — tanh(a;). From Eq. ()18|) we get for the generalized 
logarithm 

A ^-W= S»-x^- - ™ 

The associated generalized exponential (inverse of the gener- 
alized logarithm) is 



£v i ,v— \Xj — 



1 + V+X\ "+ 
1 + V-X 



[251 



Example 2 - Power laws: By setting h(x) — tanh(i) and 
v+ — 0, we get from Eq. I|24[) the so-called g-logarithm, with 
A ,„_ = log, (as) = (1 - (1 - q)x) 1/ ^ q \ where < q = 

l — v- < 1. The dual is A c *.„_ (x) = - _~ -1 = log 2 _ 9 (a;), and 
we recover the well known duality for g-logarithms. It is also 
well known that log 9 results from the use of escort distribu- 
tions |12l 1131 [TO] , while log 2 _ 9 is a natural result of the HT 
approach [8] [9]. 

An example of a generalized logarithm that is not a power 
is obtained by taking v~ = — in Eq. (|24[) . One obtains 

A 



= (x- 



l)/(^+(l + ^x2 v +)), with the dual 



A* 



Example 3 - Scale transformations: Any proper gen- 
eralized logarithm can be written as a composition of a repre- 
sentative logarithm from Eq. (|18|l and a scale transformation, 
with A = A„. „_ o g. For example pick Ao t o(x) — log(a;), and 

gd(x) = exp[l — (1 — dlog(x))d], where d > is a parameter 
of g. The generalized logarithm then becomes 



A(x) = Ao,o(</(a;)) = !-(!- dlog(x))^ 



[261 



I 3 



The associated generalized exponential is a stretched expo- 
nential, Sd(x) — exp( — ^[(1 — x) d — 1]), which is the known 
result for (c, d)-entropies with c = 1 and d > 0, [6lll4|. 

Example 4 - Different choices for h: Suppose that a 
physical situation demands a specific A and two observers 
A and B choose to represent A differently. Observer A 
chooses Ha{x) = — arctan(-|x) to represent A, so that A = 

Ai/, ,s/_ o g( A ) , and observer B chooses 1ib(x) = tanh(s) to 
represent A = AiS} tU _ o g^ B \ Then A^,„_ and AS5„_ can 
only differ by a scale transformation ga £ Go with v = v+ 
and it follows that = exp^/i^ 1 o 7ib(| log(a;))]. For 
the particular functions /ia and /is we have chosen, we get 
&(x)=exp[£tan(fg=i)]. 

Discussion 

By studying the two types of entropies that are related to the 
two possible ways to formulate a maximum entropy principle 
for systems that explicitly violate the fourth SK axiom, we 
find that there exists a unique duality that relates the two 
entropies. Consequently thermodynamic properties derived 
from those two entropies will also be related through the dual- 
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ity. We show that the maximum and minimum of Aht deter- 
mine a unique value v* for which Ats,v* is the dual of Aht- 
In this way it is possible for an object such as Aht, which 
does not explicitly carry an index v, to become dual to an 
object that does, such as Ats>. The existence of this duality 
opens the way to characterizing all possible generalized log- 
arithms as compositions of a specific functional form A„ + ,„_ 
and scale transformations g. We derive the explicit form of 
A„ + ,„„ and show that these logarithms are one-to-one related 
to two asymptotic scaling exponents (c, d) which allow one to 
characterize strongly non-ergodic or non-Markovian systems 
in their thermodynamic limit [6]. V- is shown to be directly 
related to c, while the form of the scale transformation g de- 
termines d. In summary, we provide a complete theory of all 
generalized logarithms that can arise as a consequence of the 
violation of the fourth SK axiom. 
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tional Science Foundation, grant # 1066293 and the hospitality of 
the Aspen Center for Physics. 
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Theorem 1 Let A e £. Then T v o A G C if and only 
if z/_[A] < f < i/ + [A]. Moreover, if A G C v+>v _ and T„ o A 
is a proper generalized logarithm, then T„oA£ £„,_„,„__„. 

Proof of Theorem 1. We recall that T„ is defined as 
T v (x) — (1/x + v)~ . If there exists an x > such that 
A(x) = then T„ oA(i) possesses a pole where T v oA(i) 

changes sign. In that case T v o A is neither a continuous 
nor a monotonically increasing function and therefore not 
a proper generalized logarithm. Conversely, if no x > 
exists such that A(x) = —1/v, then T v o A has no pole 
and is a continuous monotonically increasing function since 
Tl(x) = (1 + ^) -2 > for all x. Moreover, T„ o A(l) = 
and (T u o A)'(l) = T^(0)A'(1) = 1. It follows that T v o A G £. 
In order to find sufficient conditions for T v o A to have no 
such pole, we first look at the case v > 0. In that case 
no pole exists if A = min{A(a;)|a: G R + } > —1/v. In 
other words, v < — 1/A = f+[A]. Now we turn to the case 
v < 0. Then no pole exists if A = max{A(x)|a; G R + } < 
— 1/v, which is to say v > —1/A = f-[A]. Both cases 
together show that T v o A is a continuous function only if 
f-[A] < v < u+[A]. Finally, if T v o A is continuous, then 
max{T„ o A{x)\x G R + } = T„(max{A(a:)|a: G R + }), i.e. 
T„ o A = T U (K). Analogously we find T v o A = T„(A). From 
this v- [T v o A] = - 1 /T„ (A) = -l/A - v = i/_ [A] - v follows. 
An analogous relation holds for v + , which completes the proof. 

Theorem 2 Suppose a map * is given on C with the prop- 
erties (i) * o * is the identity map, (ii) for each A G L there 
exists a v* such that A* = T v * o A, and (iii) * commutes with 
scale transformations g G G (that is (A o g)* = A* o g). Then 
* is uniquely determined and u* is given by 

v* = u+ + v- , for A G C v+ ,„_ . [27] 

Furthermore it follows from theorem (1) that 

A G Cv +I „_ <4> A* G C-u_,-u + ■ [28] 

Proof of Theorem 2. The duality * on C can be con- 
structed in the following way. From properties (ii) stated in 
theorem (2) we know there exists a functional F : C — > R 
such that A* = Tf[A] ° A. From property (i) and (ii) we 
also know that A = T F ^*\ ° A*. Theorem (1) states that 
given A G C v+ , v _ the condition u+ > F[A] > v- is nec- 
essary for A* to be a proper logarithm. As a consequence 
we get Tf[A] °A G £„ + -f[A],i/_-f[A]- This further implies 
v+ — F[A] > F[A*] > V- — F[A]. Property (iii) implies that 
for any two logarithms Ai and A2 that are members of the 
same class £„ + ,„_, we get F[Ai] = F[Aa]. Therefore F can 



only be of the form 

F[A] = f(v+, —V-) for Ae£ v+ ,„_, [29] 

where / : R+ -> R. Using T„oT p = T„ +M together with 
property (i) leads to 

f(u+, -v-) = -f{y+ - f(v+, -v-), -(y- - f{y+, -vJ))) . 

[30] 

In other words, / solves the two equations 

(a) f(x,v) = ~f{x - f(x,y),y + f(x,y)) 

[31] 

(b) x>f(x,y)>-y 

for all x,y G R+. Eq. (|31l b) immediately tells us that 
/(0, 0) = 0. Consider a function y(x, z) solving the implicit 
equation 

f{x,y{x,z)) = z, [32] 

and rewrite Eq. (|31l a) as 

f(x - z,y(x,z) + z) = -z. [33] 

From Eqs. (|32[) and (|33p one gets y(x — z, —z) — y(x, z) + z. 
Also, /(0, 0) = implies 2/(0,0) = 0. By expanding y(x,z) = 
X^m n=o Vm,nX rn z' n one gets j/o,o = and for the first order 

22/0,1+2/1,0 + 1 = 0. [34] 

All coefficients y m ,n of higher order follow equation 

Vm,n = (-1)™ k ^jvm+k, n -k ■ [35] 

We also expand f(x,y(x,z)) = Y^, n =\5™^ xm y(x,z) n = 

^2Tj=i < fii,i x * zJ — z - ft follows that all <f>i,j = except for 
</>o,i = 1. Since yo.o = all terms f m , n x m y(x, z) n can only 
contribute to coefficients </>jj with indices i > n or j > n. 
Comparing coefficients order by order one shows that only 
coefficients of the first order 

fi,ox + fo,iy{x,z) = z, [36] 

contribute to solving Eq. (|32[) . Thus y(x,z) can only be of 
the form y(x,z) — yi,oX + yo,iz. This, together with Eqs. (|34[1 
and (|31l b). implies x > 2(yi t ox — y)/(l + 2/l,o) > —y- Choos- 
ing x = gives 1 > 2/1,0 > — 1- Setting y = also implies 
2/i,o > 1, so that the only possible solution for j/i,o is 2/1,0 = 1. 
As a consequence of Eq. (|34[1 . 7/0,1 = — 1. Therefore we have 
y(x, z) = x — z and / has the unique solution f(x, y) = x — y. 
From this it follows that v* — f(v+,—i>-) — p+ + v~ for 
A G C v+ .v_- This means that v* is uniquely defined. Since 
v+ — v* = — v- and V- —v* = —u+, theorem (1) implies that 
A G Cu + ,v_ A* G C- v _,- v+ . This completes the proof. 



